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The problem of solving a parity game is at the core of many problems in model checking, satisfiability 
checking and program synthesis. Some of the best algorithms for solving parity game are strategy 
improvement algorithms. These are global in nature since they require the entire parity game to be 
present at the beginning. This is a distinct disadvantage because in many applications one only needs 
to know which winning region a particular node belongs to, and a witnessing winning strategy may 
cover only a fractional part of the entire game graph. 

We present a local strategy improvement algorithm which explores the game graph on-the-fly 
whilst performing the improvement steps. We also compare it empirically with existing global strat- 
egy improvement algorithms and the currently only other local algorithm for solving parity games. It 
turns out that local strategy improvement can outperform these others by several orders of magnitude. 

1 Introduction 

Parity games are two-player games of perfect information played on labelled, directed graphs. They are 
at the core of various problems in computer-aided verification, namely model checking |[T2l . satisfiability 
checking for branching-time logics Q and controller synthesis m. They are also closely related to other 
games of infinite duration, in particular mean and discounted payoff as well as stochastic games |[6l[T2l. 

A variety of algorithms for solving parity games has been invented so far. Among these, strategy 
improvement (SI) algorithms have proved to be very successful both because of their theoretical ele- 
gance as well as their good performance in practice. The name hints at the general method according 
to which these algorithms work. Starting with an initial strategy, they compute an evaluation which ei- 
ther witnesses that the current strategy is winning, or it offers ways to improve the current strategy. SI 
is therefore in fact an algorithm scheme parametrised by a policy which determines how to improve a 
non-winning strategy. The first SI algorithm was given by Jurdziriski and Voge |[T3i which was inspired 
by and improves over Puri's SI algorithms for parity and discounted payoff games 0. Later, Schewe 
presented another SI algorithm for parity games and payoff games ifTOl . 

A disadvantage common to all these algorithms is their global nature: they require the entire game 
graph to be present at the start of the algorithm's execution. They then compute for a given player 
his entire winning region and a corresponding winning strategy. This is a drawback because in many 
applications of parity games taken from program verification one only needs to know for a specific node 
in the game which winning region it belongs to, together with a winning strategy witnessing this. 

As an example, consider the model checking problem for branching-time temporal logics like CTL, 
CTL*, or the modal /i -calculus on finite-state systems for instance. These can easily be regarded as in- 
stances of the parity game solving problem |[T2l l8l in which the nodes are pairs of transition system states 
and (sets of) subformulas of the input formula. Solving the model checking problem means solving the 
parity game locally, i.e. deciding which player wins the node composed of the transition system's initial 
state and the input formula. A global algorithm would compute this information for every (reachable) 
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pair of nodes and (set of) subformulas. In model checking, local solving is usually considered to be more 
desirable for explicitly represented state spaces |f2l. Similarly, the satisfiability problem for branching- 
time logics reduces to the problem of solving parity games Q and, again, in order to decide whether or 
not the input formula is satisfiable one needs to know whether or not player wins a particular node of 
the resulting parity game. The winners of the other nodes are irrelevant. 

It is known that the problem of solving a parity game is equivalent under linear-time reductions to 
the model checking problem for the modal jLt-calculus. There are local model checkers, in particular 
the one by Stevens and Stirling ifTTI which can therefore be rephrased as a parity game solver. It is 
generally outperformed in practice by the global algorithms - when the task is to solve an entire game - 
in particular the SI algorithms 145. It is therefore fair to say that local parity game solvers with reasonable 
efficiency in practice are desirable but do not exist yet. 

Here we present a local SI algorithm scheme for solving parity games. It solves a parity game locally, 
i.e. starting with a given initial node it expands the game graph in an on-the-fly fashion and intertwines 
this expansion with evaluations needed in order to improve cuiTcnt strategies. It terminates as soon as it 
has determined which winning region the initial node belongs to. 

The rest of the paper is organised as follows. Sect. |2] recalls parity games and global SI. Sect. [3] 
presents local SI. Sect. H] reports on empirical comparisons between local SI, global SI, and the local 
model checking algorithm by Stevens/Stirling when rephrased as a parity game solver. We show that 
local SI can outperform the others significantly. 

The contribution of this paper therefore lies in the practical use of parity game solvers. The presented 
algorithm is not meant to improve other algorithms in asymptotic worst-case complexity. Its relevance is 
justified by the improvement in solving parity games in practice which this algorithm can bring along. 

2 Preliminaries 

2.1 Parity Games 

A parity game is a tuple G = {V,Vo,V\,E,D.) where {V,E) forms a directed graph in which each node has 
at least one successor. The set of nodes is partitioned into V = VoLiVi with Vb fl Vi = 0, and : V — > N is 
the priority function that assigns to each node a natural number called the priority of the node. We write 
|n| for the index of the parity game, that is the number of different priorities assigned to its nodes. The 
graph is not required to be total. Here we only consider games based on finite graphs. W.l.o.g. we will 
assume that games are turn-based, i.e. for all v G V and all / € {0, 1} we have: if v € V^- then vE C Vi-j. 

We also use infix notation vEw instead of (v,w) G E and define the set of all successors of v as 
vE := {w I vEw}, as well as the set of all predecessors of w as Ew := {v | vEw}. 

The game is played between two players called and 1 in the following way. Starting in a node 
vo € V they construct a path through the graph as follows. If the construction so far has yielded a finite 
sequence vq . . . v„ and v„ G Vi then player / selects a w € VnE and the play continues with the sequence 
vo . . . v„w. Otherwise, the play ends in a sink v„. 

Every infinite play has a unique winner given by the parity of the greatest priority that occurs in- 
finitely often in a play. The winner of the play voViV2 ... is player / iff max{p \ yj €N 3k> j : ^{vt) = 
p} =2 i (where / =2 j holds iff \i — j\ mod 2 = 0). That is, player tries to make an even priority occur 
infinitely often without any greater odd priorities occurring infinitely often, player 1 attempts the con- 
verse. For finite plays voViV2 . . . v„ that end in a sink, the winner is given by the chooser of v,,. Player / 
wins the play iff v„ G 
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A positional strategy for player / in G is a — possibly partial — function o '.Vj ^V. A play vqVi . . . 
conforms to a strategy a for player / if for all 7 € N we have: if vj G V,- then v^+i = cj(vj). Intuitively, 
conforming to a strategy means to always make those choices that are prescribed by the strategy. A 
strategy a for player / is a winning strategy in node v if player / wins every play that begins in v and 
conforms to G. We say that player / wins the game G starting in v iff he/she has a winning strategy for G 
starting in v. Let ± denote the empty strategy. 

With G we associate two sets Wb,Wi C V; W,- is the set of all nodes v s.t. player / wins the game G 
starting in v. Here we restrict ourselves to positional strategies because it is well-known that a player has 
a (general) winning strategy iff she has a positional winning strategy for a given game. In fact, parity 
games enjoy positional determinacy meaning that for every node v in the game either v G Wq or v GWi 
||3l. Furthermore, it is not difficult to show that, whenever player / has winning strategies a,, for all v G ?7 
for some U '^V, then there is also a single strategy a that is winning for player / from every node in U. 

The problem of (globally) solving a parity game is to decide for every node v G V whether ot not it 
belongs to Wq. The problem of locally solving a parity game is to decide, for a given node v, whether 
or not V belongs to Wq. Note that it may not be necessary to visit all nodes of a game in order to answer 
this question. Consider the following trivial example. The two nodes under consideration are connected 
to each other and have no further outgoing edges. Hence, this simple cycle is clearly won by one of the 
two players. A depth-first search can find this loop whilst leaving the rest of the game unexplored. 

A strategy a for player / induces a. strategy subgame G\a '■= (y,'VQ,'V\:E\fy,Q.) where := {(m,v) G 
I M G V;- (m G dom{a) A o{u) = v)}. The set of strategies for player / is denoted by ^(G). 
Let U C V, / G {0, 1} and a be an /-winning strategy on U . The /-attractor of U is the least set W s.t. 
U QW and whenever v G V^- and vE r\W ^ %, or v ^ Vi_,- and vE (IW then v G W. Hence, the /-attractor 
of U contains all nodes from which player / can move "towards" U and player 1 — / must move "towards" 
U . At the same time, it is possible to construct an attractor strategy which is a positional strategy in a 
reachability game. Following this strategy guarantees player / to reach a node in U eventually, regai^dless 
of the opponent's choices. Let AttRg,, (?/, cj) = (W, t) denote the /-attractor of U s.t. T is an /-winning 
strategy on W . Note that the /-attractor can be computed in UUvew^^D- 
An important property that has been noted before |[T4l [T2l is that removing the /-attractor of any set 
of nodes from a game wiU still result in a total game graph. 

2.2 Global Strategy Improvement 

We recap global SI lITSllTOl . It defines a valuation of strategies and a partial order on them. The algorithm 
then starts with an arbitrary strategy, computes its valuation, improves it accordingly and iterates this until 
no improvement is possible anymore, which means that winning regions and strategies have been found. 

Strategy valuations can be defined in different ways, but usually through node valuations for each 
node V. The latter are totally ordered and induced by a path starting in v leading to a cycle. The valuation 
process then selects for each node v the node valuation associated with a path n ending in a cycle starting 
in V conforming to the strategy a that yields the worst valuation. 

Schewe LIOJ uses a simple valuation by considering an escape sink transformation of the given parity 
game: all priorities are increased by 2, two new self-cycle nodes with priorities and 1 are added to 
the graph and each player node is connected to the 1 and each player 1 node is connected to the 
0. The transformed game is equivalent to the original game, but now the only occurring cycle nodes 
in valuations are and 1 (assuming that one starts with a reasonable initial strategy). Schewe's node 
valuation therefore is either the set of nodes leading to the unprofitable self-cycle or 00, indicating that 
the node is already won by the a-player following a. 
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We follow this approach, but instead of adding additional self-cycles, we handle partial strategies a: 
whenever the strategy is undefined for a certain node v, we identify this with the decision to move to the 
imaginary self-cycle. 

Let G be a parity game. The relevance ordering < on V is induced hy Q.: v <u: Q.{v) < Q.{u); 
we extend < w.l.o.g. to an arbitrary but fixed total ordering. Additionally one defines the total reward 
ordering w.r.t. player q onV: v holds iff one of the following conditions applies. 

1. either Q.{u) =2 q and v <u 

2. or Q.{u) =2 q and Q.{v) ^2 (} 

3. or Q.{u) ^2 <], 0.{v) ^2 <] and u <v 

The length of a loopless path n is denoted by |7r|. The set of loopless paths ;r in G starting in v is denoted 
by Ug{v). 

The set of node valuations w.r.t. G, is defined as ^ := {00} u2^, i.e. a node valuation is either 00 
or a set of nodes. Given a player q, we extend the reward ordering to a total ordering on node valuations. 
LetM,A^cy withMT^iV. 



M^qN 



{max< (MAN) G A'^ and max< (MAN) =2 q) or 
{max< (MAN) € M and max< (MAN) ^2 g) 



where MAN denotes the symmetric difference of both sets. Additionally M ^qoo holds for every M C V. 
A loopless non-empty path tt = vq . . . vt-i induces a q-node valuation for the node vq as follows. 



^q{n) 



00 ifVk-l£Vl-q 

{vi I < / < ^} otherwise 



Let be a node valuation and v be a node. We define an addition operation t> © v as follows. 

t?©v := 



00 if = 00 or V € 

U {v} otherwise 



A game valuation is a map S : V — > ^ assigning to each v € V a node valuation. A partial ordering on 
game valuations is defined as follows: 

S <^qE' : ^ (E(v) ^q S'(v) for all v£V) and (S / S') 

A strategy a of player q therefore can be evaluated as follows: 

Eq^a : V mm{T^q{n) \ n G nG|^(v)} 

A game valuation S induces a total counter-strategy Ts of player 1 — ^ by selecting the least profitable 
strategy decision with respect to S: 

where Uz{v) = {u ^ vE \ \lw ^ vE : E(m) S(w)}. 

If S originates from a strategy a, Ts can be seen as the best counter-strategy against a; we also write 
Tfj for Ts„. 
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Given a game valuation E and a strategy a for player q, the set ofE-consistent nodes, CoiiG^(a,S) 
is defined as follows. 



ConG,<,(a,S) := {v G Vondom(a) I S(v) =S(a(v)) ev}U 
{v G Vi)\dom(a) I S(v) = {v}}U 
{v G Vi I S(v) = min{E(M) ev | u G vE} 



Note that for every strategy a it holds that ConG.g(cT,Scj) = V . 

A valuation E originating from a strategy a can be used to create a new strategy of player q. The 
strategy improvement algorithm only allows to select new strategy decisions for player q occurring in 
the improvement arena ^G.q,(y •= (^i ^O) ^i) E' , Q.) where 



Thus all edges performing worse than the current strategy are removed from the game. A player q strategy 
a is q-improvable iff there is a node v G V^, a node u £V with vEu and a(v) 7^ m s.t. S(a(v)) -<q E{u). 
The set of improvable nodes w.r.t. a valuation E is defined as follows. 



An improvement policy now selects a strategy for player 17 in a given improvement arena w.r.t. a valuation 
originating from a strategy. More formally: an improvement policy is a map J^^.g • -^qi^) — > '^q{G) 
fulfilling the following two conditions for every player q strategy a. 

1. For every node v G it holds that (v, J^g,^(o')(v)) is an edge in i2/G,<?,(T- 

2. If a is (^-improvable then there is a node v G s.t. E(j(cj(v)) -<q E(j(J^g(<7)(v))- 

Updating a strategy according to an improvement policy can only result in strategies with better val- 
uations (w.r.t. <l^) IIT3I . If a strategy a is not (^-improvable, the strategy iteration comes to an end. 
Winning regions and strategies have then been found in the following sense. = {v | SG,^.fj(v) = 00}; 
W\-q = {v I SG,ty,f7(v) 7^ 00}; a is a winning strategy for player ^ on W^; T : = is a winning strategy for 
player 1 — ^ on W\-q. 

SI w.r.t. player q starts with the empty initial strategy Ig and updates it according to a given improve- 
ment policy until it becomes non-improvabable. 

Algorithm 1 Strategy Iteration 
1: a ^ Ig 

2: while a is ^-improvable do 
3: O ■<r- =/g(0') 

4: end while 

5: return Wq,W\-q, o and some T as described above 



3 A Local Strategy Improvement Algorithm 

Note that global SI is formulated w.r.t. one player q: starting with a strategy for player q it iteratively 
improves it until it becomes a winning strategy on some part of the game graph. A winning strategy for 




Imp(5_^(E) := {v G Vo I S(v) < max({{v}} U {S(m) v | u G vE}) 
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the opponent on the complementary part is then derived from this. This is counterproductive for local 
solving: if local SI was done for one player only, and the opponent is the winner of the designated input 
node, then the algorithm would have to expand the entire reachable part of the game graph and it would 
therefore be at most as good as global SI. Consequently, local SI is parametrised by an improvement 
policy. 

We fix a parity game G = {V,Vo,Vi,E,D.) and an initial position v* € V for which the winner and an 
associated winning strategy is to be computed. The algorithm maintains two sets Wq and Wi in which 
it collects the winning nodes for players and 1, resp., as well as two winning strategies po and pi on 
these sets. Initially, all of them are empty. 

Local SI is carried out for both players alternatingly on potentially different subgraphs of G. Al- 
though it would be possible to run the strategy iteration only for one player, say 0, it would require the 
algorithm to expand the complete graph that is reachable from v* if player 1 wins the initial position v*. 
Therefore, the algorithm basically alternates the strategy iteration w.r.t. both players indepedently. 

For each player q, it maintains initially empty node sets Uq — called the q-subgraph — and Eq ini- 
tialised to {v*} — called the q-expansion set — with the following properties. The <7-subgraph represents 
a local view onto the game G for player q. The ^-expansion set contains nodes which lie just outside 
of the ^-subgraph and which can be used in order to expand the latter. These components satisfy the 
following invariants. 

1. Subgraph-Closure (SC): for every v ^Uqr\'V\-q it holds that vE\{Wq\JW\) C Uq, this ensures that 
for every opponent node in the ^-subgraph all possible choices are also included in the set. 

2. Winning-Closure (WC): An:KQ\^j^^^^^ q(Wq,Pq) = (W^,p^); this ensures that the set of known win- 
ning nodes is closed under attractor strategies w.r.t. the known subgraphs. 

3. Winning-Exclusion (WE): Uqf] (WqUWi) = 0; this ensures that both winning sets cannot be 
reached by a trivial strategy by one of the two players. 

4. Border-Expansion (BE): {m G \ (Wq U Wi U ?7^) | v G ?7,J C C y \ (Wb U Wi U Uq); this ensures 
that all nodes which are not in the ^-subgraph but can be reached from it via one edge are in Eq. 

Additionally, for each player q we maintain a set Iq — called the q-improvement set — , a valuation Eq for 
G\ug, a ^-strategy Oq and another set Cq — called the q-change set. They satisfy the following invariants. 

1. Subgame-Strategy (SS): dom(a^^) Uran(a^) C Uq', this ensures that the strategy associated with the 
subgame is not leading out of the subgame itself. 

2. Valuation-Consistency (VC): Uq\CoB.G^q{aq,'Eq) Q Cq CI Uq', this ensures that the ^-change set 
contains at least all these nodes of the ^-subgraph that have an inconsistent valuation. 

3. Improvement-Covering (IC): Iq \ Cq = lmpQ^{Eq) \ Cq; this ensures that the ^-improvement set 
contains all improvement nodes of the ^/-subgraph disregarding the ^-change set. 

Initially, Iq, Cq as well as Eq and Gq are empty. Finally, the algorithm maintains a current player q* 
initialised arbitrarily. 

It is parametrized by the following policy that is allowed to access all data that is maintained by the 
algorithm. 

1. Improvement Policy: a function ImprPol that returns a non-empty set of pairs (v, w) s.t. v G 
Uq* nVq*, u G Uq* and (v, m) is a proper improvement edge w.r.t. Eq*. 

Whenever the algorithm encounters a new set of winning nodes, it calls the function WINNING with 
parameters q, W and p where q is the winning player, W is the new set of nodes won by q and p is an 
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Algorithm 2 Update of Winning Set 

1: procedure Winning(^, W, p) 

3: Wq^WqU W 

4: p^^PyUp' 

5: S^UvGW'-^v 

6: {Io,h)^{Io\W',h\W') 

1: iEo,Ei)^{Eo\W',Ei\W') 

8: (Co,Ci) ^ ((Co \ W) U {UoHB), {Ci\W') U ([/j nS)) 

9: ([/o,[/i)^([/o\W,[/i\W') 
10: (Eo,Si)^(Eo\W',Ei\W') 
11: {Oo,Oi)^{Oo\W',Oi\W') 

12: end procedure 



associated winning strategy on IV. It computes the ^-attractor of the given winning set in the subgraph 
UqUUi, adds it to the winning set of q and updates all other sets accordingly. In particular; the change 
sets are updated to contain the border nodes in the respective subgraph. 

In order to refer to the data before and after the call, we refer to the unchanged name of a data variable 
for its value before the call and to the f-superscript for its value after the call. 

Lemma 3.1 Let (SC), (WC), (WE), (BE), (VC) and (IC) hold for both players. Let q be a player, W C [/^ 
be a set and p be a winning strategy for q on W. Then, calling WlNNlNG(<7,VK,p) preserves (SC), (WC), 
(WE), (BE), (VC) and (IC)for both players. Additionally c], = (Q/ \ Wj) U {Uqi n U,,eiy;\w, Ev)for both 
players q' and W C W^. 

Also, W,( C and C [/J, U for both players q'. 

Local SI expands the subgraph Uq* of the current player q* if there are no more improvement edges 
in Uq* and the winner of v* has not been decided yet. Then, it calls a function Expand with parameters q 
and N, where <7 is a player with a non-empty expansion set and A'^ is a non-empty subset of the expansion 
set. This essentially adds the minimal set A'^' to Uq that contains s.t. the new Uq fulfills the Subgraph- 
Closure property again. Additionally, the (^-change set is increased by the set of new nodes; it may 
happen, that expanding Uq reveals new edges that lead to nodes in the winning sets Wq or W\ . The 
Expand routine puts these nodes aside and calls the Winning procedure in the end to update the 
winning sets. Therefore, it is possible that calling Expand actually decreases the size of Uq. 

Lemma 3.2 Let (SC), (WC), (WE), (BE), (VC) and (IC) hold for both players. Let q be a player and 
N (lEqbea set. Then, calling ExPAND(g,Af) preserves (SC), (WC), (WE), (BE), (VC) and (IC)for both 
players. Additionally NQUj^U wj U wj and c]^, = {Cq> \ Wj) U {Uq> D {Jyew\w ^ (U^, \ Uqi) for 
both players q' . 

Also, Wqi C Wj and Uq> C J/J, U Wj for both players q'. 

Whenever the change set Cq of a player q is changed, a re-evaluation process needs to take place 
that updates the valuation Eq accordingly. The local evaluation for player q can be invoked by calling 
Evaluate. During this process, it is possible that some nodes of Uq are identified as being won by 
player q. The function therefore returns a set of nodes newly won by player q as well as a winning 
strategy on this set. It updates all dependent node valuations as long as their valuation is not locally 



O. Friedmann & M. Lange 



125 



Algorithm 3 Expansion of Nodes 



procedure Expand(<7, A'^) 
(%',p^)^(0,±) 
(W/,p()^(0,±) 
while A'^ is not empty do 

V ^ pick from A'^ 

A^^Af\{v} 

Ug^UqU{v} 

E,^E,\{v} 
c ^ chooser of v 

if there is some u G U s.t. vEu then 

w;^w;u{v} 

else if M G Wi_c U for every m with vEu then 

else if c = g then 

Eg^EgU {vE \ {Uq U Wo U W;; U m U W/) ) 
else 

N ^NU{vE\{UqUWoUW^UWiUW'y)) 
end if 
end while 

WINNING(0, Wq', P^) 
WINNING(1, W/, p[) 

end procedure 



consistent. The improvement set is then updated accordingly and the set of nodes that are won by the 
current valuation is returned along with the con^esponding strategy. 

Lemma 3.3 Let ( SC), (WC), (WE), ( BE), (VC) and (IC) hold for both players and let qbe a player Then, 
calling EvaluaTE(<7) terminates and — returning W and p - preserves (SC), (WC), (WE), (BE), (VC) 
and (IC)for both players. Additionally, p is a winning strategy of q on W and Cq = 0. 
Also, Wq' C and Uq' C J/J, U Wj, for both players q'. 

Local SI is then realised by the procedure MAIN which runs for as long as the initial node v* is 
not contained in one of the two winning sets. If both the improvement set and the expansion set of 
the current player are empty we know that the other player wins the whole subgraph. Then it adds the 
entire subgraph to the opponent's winning set. Otherwise, either one of the two sets is non-empty. If the 
improvement set is empty, it expands the subgraph according to the picked non-empty set. Otherwise it 
improves the subgraph according to the improvement policy. Additionally, switching between the two 
players is consulted. Now, it may be the case that the valuation for one of the two players needs to be 
updated. Since an update of the valuation of one player may affect the winning sets of both and hence 
may require another update of either player's valuation afterwards, it re-evaluates for as long as the two 
change sets are non-empty. 
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Algorithm 4 Local Evaluation 
1: procedure EvALUATE(^) 

2: X ^ ± 

3: W ^0 

4: D^0 

5: while Cq is not empty do 

6: V ^ pick from Cq 

7: Q^Q\{v} 

8: ^ {v} 

9: if V e V^y and v G dom(atJ then 

10: ^Eq{aq{v))®V 

11: else if V G Vi then 

12: ^min-<jS^(M) ev I M G vETl?/,/} 

13: end if 

14: if T> /E<^(v) then 

15: S^(v) ^ 

16: D^DU{v} 

17: Cq^CqU {Ev n i/^) 

18: end if 

19: end while 

20: D^DUUveD^v 

21: while D is not empty do 

22: V ^ pick from D 

23: D^D\{v} 

24: if S^(v) = oo then 

25: /,^/,\{v} 

26: W ^ W U {v} 

27: if V G Vq then 

28: T ^ t[v I-)- a^^(v)] 

29: end if 

30: else if v G then 

31: if there is some u GvEHUq with Eg(a^(v)) -^q 'Eq{u) then 

32: Ig^IqU {v} 

33: else 

34: /,^/,\{v} 

35: end if 

36: end if 

37: end while 

38: return W and p 

39: end procedure 



Lemma 3.4 Calling MAIN terminates s. t. there is one player q with v* G Wq and Pq being a winning 
strategy for q on Wq. 
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Algorithm 5 Main 



while V* ^WoUWi do 

if£^* =0and/^* =0then 

else 

if 7,^* = then 

A/^ -^pick non-empty subset of Eq* 

EXFAND{q* ,N) 

else 

I ^lMPRPOL(data) 
for all (v, m) G 7 do 

Oq* (v) ^ u 

Cg* ^ Cg* U {V} 

q* ^l-q* 
end for 
end if 
repeat 

stable ^ true 
for / = 0, 1 do 

(W,p) ^Evaluate (/) 
ifW/0then 
stable false 
Winning(/,IV,P) 
end if 
end for 
until stable 
end if 
end while 



4 Experimental Results 



We report on the performance of the local SI algorithm in practice. We compare it in particular to two 
other algorithms for parity games: global SI by Jurdziriski and Voge llT3l and Stevens and Stirling's local 
-calculus model checker |[TT|. Local SI is supposed to remedy one of global Si's distinct disadvantages, 
hence this comparison. Note that both global SI algorithms |[T3l [TOl perform quite well in practice 
although Zielonka's recursive algorithm |[T4l is usually even a bit better lH. However, it is the case 
that, whenever local SI is faster than global SI, it is also faster than the recursive algorithm. We also 
remark that the recursive algorithm is inherently global in nature, and we know of no idea how to create 
a local variant of this one. Furthermore, the parity game solving algorithm derived from the local model 
checking algorithm is of course a local algorithm already. It is therefore also natural contender for local 
SI. Previous tests have shown that it can be much slower than the global algorithms H|. This comparison 
therefore shows that this is not due to the locality but to the solving technique. 
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Local SI for Parity Game Solving 



Local SI has been implemented in PGSolverQ a publicly available tool for solving parity games 
which is written in OCaml and which contains implementations of the two competing algorithms as well 
as many others. Furthermore, PGSOLVER uses a specialised routine for global solving called generic 
solver. It does not feed the entire game graph to the chosen solving algorithm but performs optimising 
pre-processing steps like decomposition into SCCs and removal of trivial winning regions etc. These 
greatly speed up global solving |l4l|. The runtime results of the global algorithm reported here are obtained 
with all optimising pre-processing steps. On the other hand, these steps do not make sense in the context 
of local solving because the removal of trivial winning regions for instance is in fact a global operation. 
Hence, we do in fact compare local SI as it is against global SI embedded in this optimisaing framework. 

Local SI is run in these experiments with the standard all-max policy on the respective improvement 
sets. 

All tests have been carried out on a 64-bit machine with four quad-core Opteron™ CPUs and 128GB 
RAM space. As benchmarks we use some hand-crafted verification problems and random games. We 
report on the times needed to solve the games (globally vs. locally) as well as the number of nodes in 
the games that have been visited. This is an important parameter indicating the memory consumption 
of the algorithms on these benchmarks. The times for global SI are composed of the time it takes to 
generate the parity game (which is not part of the solving, strictly speaking, but needs to be considered 
too in comparison to the local algorithms since they generate the game while they are solving it) and the 
time it takes to perform the strategy iteration. We only present instances of non-negligible running times. 
On the other hand, the solving of larger instances not presented in the figures anymore has experienced 
time-outs after one hour, marked t- 
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Figure 1: /I -Calculus Model Checking 



/X -Calculus Model Checking. As first benchmark we use the problem of verifying an elevator system. 
It serves n floors and maintains a list of requests which may contain each floor at most once. A detailed 
description can be found in [4]. The task is to verify that all paths satisfy the following fairness property: 
if the top floor is infinitely often requested then it is infinitely often served. This property can be for- 
malised in the modal /i -calculus with alternation depth 2, and the resulting benchmarking parity games 
are obtained as /i -calculus model checking games. 

We consider two different implementations of the system, differing in the way that the request list 
is stored: in FIFO or in LIFO style. Note that the FIFO implementation satisfies this fairness property 
because any requested floor will eventually be served whereas the LIFO implementation does not satisfy 
it. This is because new requests will be served before older ones and therefore requests can starve. 



http : //www . tcs . if i . lmu.de/pgsolver 
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With the FIFO system, both local algorithms need to explore the whole game in order to detect that 
the universally path quantified formula is satisfied or, equivalently, to find a winning strategy for player 
which comprises the entire game graph. It is not too surprising that global SI with all its optimisations 
outperforms both of them in this case. But note that on the LIFO system, both local algorithms explore 
only a tiny pait of the whole game and hence outperform the globally operating algorithm. Moreover, 
local SI is even much faster than the local model checker. See Fig.[T]for all the runtime results. 

LTL Model Checking. We take the well-known dining philosophers problem as a system to be veri- 
fied. It consists of n forks, each of which can be in three different states (on the table or held by the left 
or right philosopher). Transitions are being done by lifting up or placing down one fork. Philosopher / 
eats as soon as he holds two forks. We check the LTL formula \/"=i FGeats, on the system which asks 
for a scheduling of the fork lifting that enables one of them to eventually eat all the time. Note that LTL 
model checking can also be viewed as parity game solving El. 

Here the local algorithms only need to find a particular scheduhng out of all possible ones which 
enables one of the philosophers to eventually eat all the time. Both the local SI and the local model 
checker outperform the global algorithm, again just because they only explore a small part of the whole 
game graph. Furthermore, local SI is remarkably more efficient than the local model checker in these test 
cases, i.e. it explores only a fractional part of the entire game in comparison and therefore needs much 
less time. See Fig.[2]for all the runtime results. 
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Figure 2: LTL Model Checking 



Random Games. Finally, we consider games generated by a random model distributed with PG- 
SOLVER. It essentially creates small random games from a uniform distribution and connects them ran- 
domly in order to obtain a non-trivial SCC structure and therefore to prevent the creation of untypically 
special games H. The initial node is the — arbitrarily chosen — first node in the games representation. 
The results in Fig. [3] show that local SI is again much better than global SI and the local model checker 
on these games. Note that the number of visited nodes given there — as well as the running times — 
is the average over 100 runs and therefore not necessaiily an integer value. Also, these random games 
cannot be created locally. Thus, the times given for all algorithms are the pure solving times without 
generation. 

5 Further Work 

We have presented an on-the-fly strategy improvement algorithm for solving parity games and shown that 
it can outperform existing global and even other local methods by several orders of magnitude. To be 
precise, the local SI algorithm presented here beats global algorithms in cases where a winning strategy 
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Local SI for Parity Game Solving 
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Figure 3: Random Games 



does not stretch over the entire game graph, even when the global algorithms are embedded into the 
optimising framework of the generic engine in PGSOLVER. 

It remains to see which of such optimisations which have been shown to aid global solving signifi- 
cantly m can be used in order to tune local SI, and whether these will also act as optimisations in this 
context rather than a slow-down. For instance, it may be possible to do the SCC decomposition of a 
game graph in a top-down fashion. 

Note that in the expansion step, the local SI algorithm - as it is presented here - picks some node 
with a yet unexplored subgraph behind it and continues the exploration with this node. It is of course 
possibly to parametrise the algorithm with respect to the choice of this node. To be precise, one could 
consider yet expansion policies which - like the improvement policies - guide the algorithms choice of 
such nodes. However, note that the only information that these policies would have to rely on in order 
to choose a good node is a cun^ent node's successor names, priorities and their evaluation. The question 
whether or not one of them is good for the choosing player, however, depends on the structure of the 
subgraph lying behind this node rather than on this information. Naturally, this subgraph is not visible 
to the policy for otherwise the algorithm would not be local. We therefore suggest to simply pick such a 
successor by random choice. 

A natural question that also arises is how to localise other global parity game solvers, in particular 
Zielonka's recursive one |!T4l and Jurdziriski's small progress measures algorithm [7|. We currently see 
no method of doing this in a way that could improve over the global variants Uke local SI improves over 
global SI. 
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